On the non-3-colourability of random graphs. 
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Abstract. We show that for c > 2.4682, a random graph on n vertices with cn(l + o(l)) edges 
almost surely has no 3-colouring. This improves on the current best upper bound of 2.4947. 



1 Introduction 

An old problem on random graphs remaining open to this day is that of the existence and determination 
of a fc-colourability threshold, already posed in the paper [|H]] which launched the whole subject. Using 
the uniformly distributed model G (n, m) of graphs with m edges on n vertices, it reads: does there 
exist a constant such that if m ~ (c^ — e)n for some e > as n — > oo, then almost all graphs in 
G (n, m) are /c-colourable, while if m ~ (c^ + e) n almost none is? The conjectured positive answer 
has been mostly pursued in the case k = 3, the smallest value for which fe-colourabilty is an NP- 
complete problem. It is now supported by computer experiments which put C3 at about 2.3, and a 
non-uniform version is known to hold O, stating that at least an n-dependent (n) exists with the 
required property. Whether ct (n) converges remains open, but its behaviour in the large n limit is 
constrained by proven upper and lower bounds which are getting progressively tighter. 

The first lower bounds were by-products of studies on the existence of a fc-core, which is a necessary 
condition for non-fc-colourability, but understandably perhaps, even the exact threshold for the A;-core 
p(3| | yields a mediocre bound for colourability, e.g. 1.675 for k = 3. More recently, better bounds were 
achieved by analyzing simple colouring algorithms, using the powerful differential-equation techniques 
of pi]] . This gave C3 > 1.923 H, and to go beyond that, more complex extensions of Wormald's 
techniques had to be sought [||, leading to the current best lower bound of 2.015 for C3. 

Upper bounds are generally based on the ubiquitous first-moment method, starting with the 'naive' 
bound Ck < klogk as obtained by Devroye see 0. For k = 3, this is 2.71. More recent bounds have 
used the expected number of colourings with a local minimality property similar to that introduced for 
A;-SAT [||, |l8fl . In however, the property was weaker than it might have been, giving nevertheless 
C3 < 2.60. This was corrected by ||, introducing the more restrictive 'rigid colourings' but using an 
untight bound for a probability appearing in the expectation. This gave C3 < 2.522. Finally, this was 



improved to 2.495 by [15|, correcting an error in |17|], and independently by |U]] . They used a rather 
sophisticated occupancy result of |l(| to evaluate exactly the probability just mentioned. We seem 
to be reaching a point where the sheer complexity of the calculations needed to extend the method 
becomes a hindrance to further progress. 

This paper lowers the upper bound on C3 to 2.468155, i.e. by a similar amount as between the 
current best and previous best. We still use rigid colourings, but now the graphs themselves are 
restricted to a subspace sufficiently 'dense' for the first-moment (Markov) estimate still to apply in 
the limit, but that also leaves out many graphs that contribute to raise the first moment. Perhaps 
the crucial point is that the calculations are actually fairly straightforward if well taken. On the other 
hand, some side issues may be a bit tedious to check explicitly. In this extended abstract, we will 
concentrate on the main flow of the calculation. We first introduce the subspace of graphs we are 
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restricting our attention to, and show how calculating the first moment of rigid colourings in this space 
yields an upper bound for C3. We then perform the actual calculation, leading to a pair of nonlinear 
equations in two unknowns. Existence and uniqueness of a solution is then discussed, justifying a 
simple iterative procedure, the result of which is plugged into the expectation. For the above-quoted 
value of c, this gives an expectation just below 1. 



2 A large subspace of random graphs. 

Throughout the paper, m = cn (1 + o (1)) , and 2.4 < c < 2.5. (Non-colourability is known to hold 
a.s. above 2.5, and what happens below 2.4 is not our topic here.) We also denote the probability 
space by G(n,m), and, setting A = 2c, by p(x, A) or p x the Poisson probability function of mean 
A, i.e. e~ x \ x jx\. To avoid irrelevancies in our enumerations, we will consider directed graphs given 
by an ordered list of edges; multiple edges and single-vertex loops are allowed (although statistically 
insignificant). This changes nothing as to existence and value of the threshold. With these conventions, 
\G (n, m)\ = n Xn . 

First, let the random variable X denote the number of vertices of a random graph having degree 
x. We show that 9 X is concentrated around its mean which is p x : 

Lemma 2.1 There is an absolute constant C > such that for any e > 0, 

Pr (\6 X - Px \ > e) < C>/Ane~ c(e » p " )n ) (1) 

where c (£, n) = min [(£ + rf) log (1 + £/??) — £, £ 2 / (2t?)1 . In particular, linin^^ Pr (\9 X — p x \ > e) = 0. 

Proof If Ki is the degree of vertex i, the random vector {Ki) 1<i<n follows a multinomial distribution, 
which we can view as describing An indistinguishable balls (the extremities of the cn edges) being 
thrown into n bins (the vertices). A simple Poissonization argument (considering the situation where 
a Poisson number M of balls with mean An are thrown) shows that there are independent r.v.'s 
Lj, with mean A, such that the K^s are distributed as the Lj's, conditional on M = An. The sum 
W x = l{L i= a;} obeys a binomial large-deviation inequality: 



Pr 



— - Px 
n 



> e < 2 e 



-c(e,p x )n 



where c(e,p x ) = c e / Px p x , and c £ is as in [5], Corollary A. 14. Decomposing w.r.t. the values of M and 
using a standard inequality for Poisson r.v.'s yields (|]). I 



It is interesting to note that in the sequel, we do not need the full strength of Lemma [2TL but only 
the weak form of concentration stated at the end. 

Now, to e > and x max € N, we associate the set Q (e, x max ,n, m) of graphs such that for < x < 
Xmax-, the number of vertices with degree x lies between (p x — e)n and (p x — e) n. The idea is that in 



view of Lemma 24., for x max large enough and e small enough, Q (e, x max , n, c) contains 'most well- 
behaved' members of G (n, m) , and therefore that the expected number of (rigid) 3-colourings of graphs 
drawn uniformly from Q (e, x max , n, m) provides an upper bound for Pr (3— Col) , the probability of 
3-colourabiility. But, the discrepancy between the first-moment (say, rigid-colouring) bound and C3 is 
due to a 'small' (yet exponential) number of 'rogue' graphs having a huge number of colourings, many of 
which are left out from Q (e, x max ,n, m) . Consequently, the expectation just mentioned actually gives 
a better bound than the rigid colourings by themselves. Of course, in dealing with Q (e, x max , n, m) we 
have to control approximations. As for the calculations, performing them in a constrained subset of 
G (n, m) is actually in some ways beneficial, e.g. we do not need the balls-and-bins occupancy results 
of@. 

The following proposition makes precise the general idea just explained while staying at the level 
of the original probability space G (n, m) , which is technically simpler than working in the subspace 
Q (e, x max ,n, m) . The r.v. R (G) is defined as the number of rigid colorings of G G G (n, m) . 



Proposition 2.2 Let the r.v. X EXmaxnm on G(n,m) be defined by 

v (r\-l R ^ if G £ G (e,x max ,n,m) 

*e,x max ,n,m [L,) - | Q otherwise . 

If, for some integer x max and some e > 0, E [X ejXmax ,n,m\ tends to as n — ► oo, then so does 
Pr(3-Col). 



Proof Let 3-col(re, m) = {G G G (n, m) : G is 3— colourable} . Then, by Lemma 2A and since 
R (G) > 1 for G G 3-col(n, m) : 



Pr (3-Col) 



1 3— col(n, m)\ 

\G (n, m)\ 
\G (e, x max ,n, m) Pi 3-col(n, m)| 
|G (n, m)| 

|{G G G(n,m) : 3x (0 < x < x max and (G) - p x | > e)}| 



+ 



\G (n, m)\ 

- \r( l \\ l + (x maa; + l) (l) 

G(n,m) ^-^ 

G&Q(E,x max ,n,m)C\'&—co\{n,m) 

^ \rT \\ E fl(G)+o(l) 

G(n,m) 

Ge5(e,x ma2 :,n,m)n3— col(n,m) 



3 Combinatorial analysis of the expectation. 

First, let £ , XmaM be the set of vectors = (e x ) < x < Xmax m ^n ma * +1 = {0, 1/n, 2/n, 1}"— +1 
with Yjx^o* ®x — ^1 Ex™? 1 s ^ — \ an( i an l^x — Px| < £ - Since this set is of polynomial size 
< (2en) Xmax+l , and polynomial factors are irrelevant in our study, counting graphs in Q (e, x max ,n, m) 
with some property V really boils down to counting, for fixed 6 £L® £ ,x max ,n,mi graphs with V in the 
set Q {&) of G G G (e,x max ,n,m) with (6> x . (G))o< x <aw = 6 *' 

To say that G G ^ (0) is 3-colourable means that there is a partition of the vertices into vertices of 
types 0, 1 and 2 ('blue', 'red', 'green') such that there are only three types of edges: types (joining 
a blue and a red vertex), 1 (red and green), and 2 (green and blue). 

Recall also that a rigid coloring is one in which every vertex of type has edges joining it to at 
least a vertex of type 1 and at least a vertex of type 2, while every vertex of type 1 is joined to at 
least a vertex of type 2. 

Now, given such a partition and rationals O , (3 2 G I n , as well as /i x j, /j, x -, fj, x j (0 < x < x max , < j 
we count the graphs in Q (6) with: 

• /3jTO edges of type i, and: 

• the vertices of degree x being distributed as follows: for i = 0, 1,2 and < j < x, there are 
H x j9 x n vertices of degree x and type i, each of which has j (type-i) edges joining it to vertices 
of type i + 1 (mod3) , 

which are rigidly coloured. Let Z (0, [3, /x, n, m) be the number of such graphs. 
Note that the colouring being rigid says exactly that fj, x = fx x x = and fj, x = 0. Also, any 
vertex of degree must be of type 2, and any vertex of degree 1 must be of type 1 or 2. Accordingly, 



we will be considering only n x . for 2 < x < x rnax , 1 < j < x — 1, and n x ■ for 1 < x < x max , 1 < j < x. 
For /x^ j, no restriction applies. 

We first choose, among m empty templates representing the edges, those corresponding to each 
type of edge, and within each edge template, the colour of each vertex (recall that in our model the 
edges are directed). This can be done in A n (/3,m) ways, where 

A n {/3,m] 



(J3 m)\(p im )\(l3 2 m)\ " 

Second, we attribute each vertex a type. Within each group of a x -Q x n vertices of degree x < x ma x, 
we comply with the above-stated requirements. The remaining rre vertices, with r = 1 — X^=(T ^> 
will be those of degree > x max . The number of ways this can be done is: 

B n {9, fjt,n) ■■ 



(9 ny.(9 1 n)l..(9 Xmax n)\(rn)\ 

(6 s n)\ 



Finally, we effectively fill the template locations with the vertices. of various types. Let M n {9, /3, /x, re, m) 
be the number of possibilities here. To begin with, consider the vertices of high degree (> x max ). They 
are to occupy an places, with a = (A — X^=cT x @x) (1 + (1)) and o — ► as x max — > +oo and e — > 0, 
uniformly in 9. The ways to assign them are certainly less than 

rj(6,n,m) = | ) (™) <jra . 
\anj 

The (3 Q m type-0 edge templates contain 2f3 1 m vertices, /^m of which are already known to be blue, 
and ftm red. Let us say that among the blue ones, a®m are of high degree and already assigned, and 
a\m among the red ones. The number of ways to fill the still-free places in the type-0 templates is, 
then: 

M [(/W8H' [(lo-^)H 

n::rn;:}j^ M ns%i(^-j)^ " 

We fill similarly the type-1 and type-2 templates, with 

[{13,- a\) m]\ [{13,- a\) m]\ 



x,3 



Mi 



and 

[{/3 2 -4)m]\ [{(3 2 -al)m]\ 



x—1 /_ -■MA 4 " -Qx.n 



r& uu^ xn n : 2 n U (* - j) 



possibilities, respectively. By construction ctq + o"5 + crj + o"3, + o\ + o"q = o". All in all, 

M n {9, (3, //, re, m) < M0M1M2V {9, re, m) , 

Z{0,(3,fi,,n,m) = A n {(3,m) B n {9, fj,,n) M n {9, (3, fx,n,m) 

< A n (J3, m) B n {9, n, n) MqMxM 2 t] {9, n, m) . 

and, for the expectation referred to in Proposition [E^ : 

E [X £ , Xmax , n , m ] < ^ {2en) Xma * +1 ^ A n (/3, m) £ n (0, /x, re) M„ {9, (3, fi, n, m) , (2) 

where (3 and /x in the sum are constrained by a relationship which we shall examine later. 



4 Asymptotics. 



Using an inequality version of Stirling's formula and/or upper bounds on multinomial coefficients 
derived from it, it is seen that (with the convention that an empty product is equal to f): 



A n (Am) V "< (l + o(l)) 



(l + o(l)) 



rx-l mS 



1 Ml. 



2 A 



1 lz=0 I ^ 1 lj=l Mr j 1 lj=l May 1 lj=0 M X) . 



-Mo /n < (l + o(l)) 
-M^ < (l + o(l)) 

< (l + o(l)) 
so that, using j! (x — j)\ = x\/ (p : 

(7W A^i^2) 1/n < Co M ■max 



(/Vn/e)^-^) 



(/? 2 cn/e)( 2 ^^-' T °) c 

ii:;;t (n; n.r : <n; '*•<• .^ /,; - 



C (cn/e) (2 - CT)c ^^ ^(^ij+M^Je. 



n 



.X=0 X ' 



IE=0 3=0 

where lim E -+o,x max -*+oo Co (^w) = 1, uniformly in 0, and that 

(^(/3,m)i? n (0,/x,n)r ? (0,n,m)^o-Mi^2) 1/n < J^^T'i x 

2 C ($$&)\cn/e) 



2c 



n%max 1 
x=0 11 j=0 



-i ifi -. i/ 1 



(!) 



(!) 



_ ,,2 



(!) 



where Co nas the same property (and by convention 0° = 1). 

Further, we can get 0-free estimates where 6 X is replaced throughout byp x , at the price of additional 
factors which all tend to 1 as e — > and x max — ► +oo. In the end result, the sum in @) being of a 
polynomial number of exponentially-behaved terms, 



Ci (e i %max ) 



2 C EE=o" (arte 



(3) 



max • 
Am 



1 [^'max 1 1"^' 

llx=0 llj=0 



(!) 



(!) 



(!) 



,2 NPi' 



where lim £ ^o. 



Ci (£,x max ) = 1, and the max is under constraints to which we now come. 



5 The optimization. 



We set: 

x-l 



3=1 3=1 3=0 



and note that for < x < x 



max i 



a° x + a\ + (4 = 1, (4) 

and that ckq = $ = 1, = /ig o = a o = Mo,o = a i = Mi,o = Ml 1 = a ^ so ' Ml 1 = a i- 

Introduce the reduced blue, red and green spreads </? , ip 1 and <^ 2 of the coloured graph. These are 
the quotients by An of the numbers of places of the corresponding colours in our filled graph template 
(excluding the places occupied by vertices of degree > x max ); namely, 

3>max %max % 

<Pi = A -1 £ xp x a x = A -1 £ xp x £ (5) 

x=0 x=0 j=0 

Note that (3 + (3 2 = 2i Po + Vo (e, x max ), where lirn £ ^o,a: roM: ->-+oo (e, x max ) = 0. Similarly, (5 + (3 1 = 
2ipi + rji, Pi + (3 2 = 2ip 2 + r/ 2 , and tp + ipi + (p 2 = 1 + r/ 3 , (actually r/ 3 depends only on x max , see 
below). Thus, f3 = 1 — 2(p 2 + r] 4 , and similarly for f3 1 and (5 2 . From this we see that (3) still holds 
with 1 — 2ip i replacing /3j +1 ( mod3 ), and the maximization on fi alone (subject to < (p t < 1/2), under 
the penalty of a slightly larger (± (e,x max ) which still tends to 1 in the limit of e — > 0, x max — > +00. 
Also, the max may be extended to 11 being a vector of reals in [0, 1] . We are therefore looking to solve 
the problem: minimize the function 



/ (A*) = £ Px £ 

x=0 j=0 



/' \,.„^ x >i 1 ,,1 ^„„ fJ ' x >^ 1 ,,2 i„„^,3 



1 ^a;,3 , 1 1 ^,3 2 1 

Mx,3 — + H sJ log -py + /^j log 



3> \j/ y 3> 



+c (1 - 2^ ) log (1 - 2^) + c (1 - 2^) log (1 - 2y x ) + c (1 - 2^ 2 ) log (1 - 2<^ 2 ) 

(where by convention OlogO = 0), subject to the constraints = C x = a x + a x + a x — 1 for < 
x < x max , fi > 0, < <Po,tpi,tp2 — an d n x = n x x = n x = (so that these are not really 
variables, and we view fx as a vector in ^x max (x max +i)/2 anc j not -^3(x max +i)(x max +2)/2y Setting 
(fi m i n = 0.26 and (f max = 0.4, simple calculations indicate that within our chosen c domain, the 
expected number of (unrestricted) 3-colourings such that (p < (p min or ip Q > (p max tends to zero 

3x (x +l)/2 

anyway, and similarly for </p 1 and <p 2 This means that we can restrict n to the set Li =M + maxy max " n 
{^i : (/'min < < ^maxi^ = 0, 1, 2} . W is not open, but it can be seen directly that a vector fi with a 
null coordinate (recall that we have excluded the coordinates required to be null) cannot be a local 
minimum. So we can replace M+ with ]0, +oo[ and minimize on the resulting open set T>, where 
differential techniques can be used. 

Since the constraints above are linear, the classical method of Lagrange multipliers applies 
without having to check for some constraint qualification such as linear independence of gradients 
(which is true, though). Associating a Lagrange multiplier A x to the constraint C x = 0, a necessary 
condition for a local minimum is 

%max 

= V/+ £ A X VC X . 

x=0 

This gives, for i = 0, 1, 2, with j ^ {0, x} if i = 0, and j ^ if i = 1 : 



/ ! - >..// (>X P /V/- !>.<■ ■ 1 -r; \j) B{x,<p) 



which we plug back into C x = to find that the denominator is 

B (x, tp) = max (0, 2 X - 2) (1 - 2ip Q f + (2 X - 1) (1 - 2 ¥ > 1 ) x + 2* (1 - 2<^ 2 f . 

With the (^j defined by (5), our necessary condition (||) is a rather hopeless system of 3x max (x max + 1) /2 
nonlinear equations in as many unknowns. However, its peculiar form means that if we view (|], 5) 
as a system of equations in /x and (p, there is an 'easier' way to solve it, namely eliminating the 
/i's by plugging the r.h.s's r % x ■ (ip) of @ into (5). Noting further that the constraints C x = imply 
ip 2 = U (x max ) — ip — ipi, with U (x max ) = A -1 X^x=(T we obtain a much nicer necessary condition, 
namely two equations in the two unknowns ip Q and (fi : 

= \ip Q - ^ xp x ^2 r x j (ip , Vl , U (x max ) -(p - (fx) , (7) 

x =0 j=0 

= \<Pl ~ XPx r *j (^0. Vl, U ^max) -<P0~ <P\) ■ (8) 

x =0 j=0 

(In practice, we take x max sufficiently large so that U (x max ) can be replaced by 1.) Having solved (7, 
8), we recover fi from (^). 

Of course, even a system of two nonlinear equations can be unmanageable, but in this case a 
change of variables ip = yo + y\, ip^ = yo — y\ turns ([?], 8) into 

K l (y ,y 1 )=0, {) 

where Kq and K\ are functions that, within our restricted range p min < <p^ < p max , are found to be 
monotone in each variable separately, with partial derivatives 

8K dK dK x dKx 

«- a >o, ir ii >o, -^>o, ir ^<0. 

oyo ayi dy dyi 

Since along K, L = 0, we have dyi/dyo = — (dKi/dyo) / (dKi/dyi) ,it follows that y± decreases in yo 
along Kq = 0, while it increases along K\ = (see Fig. 1). Further, for the smallest attainable value 
of yo iu our range, the solution in y\ of Kq = is larger than that of K\ = 0; while the reverse holds 
for the largest attainable yo- Thus, the existence and uniqueness of the solution to (9), and therefore 
also to ([7], 8), simply follow from the intermediate value theorem. Since a minimum of / (//) in U 
must exist (corresponding, in the limit, to the maximum term of E [X e XmaX!nm }), and since there is 
no local minimum on the boundary of U, it must be a point of null gradient in D, so this is it. 




Figure 1: The implicit functions defined by (10). The first 
equation defines the decreasing function, the second the in- 
creasing one. 



6 The numerical calculations. 



We now evaluate our modified estimate (||) with the ip i and [i % x ■ derived from (|7|, 8) and (g) together 
with fJ. x0 = fJ, xx = a4 o = 0- We have, for i = 0,1,2 (since the factors corresponding to fi 
evaluate to 1 on both sides of the first equality): 



nn 

x=0 j=0 



(f) J 



nn 

x=0 j=0 



(1 ~ 2^) 3 
B{x,<p) 



n 

x=0 



(l-2yj 3 



so, taking account of (0), i.e., C x = 0, 



nn 

z=0 j=0 



/4i 

(■) 



(•) 



xw, [(1 - 2^T l (i - 2^) a * (1 - 2^ 2 y 



xp.r 



x=0 



(1-2^)^(1-2^(1-2^ 



B(x,tpY 

X(p 1 



Px 



Finally, since lim Xn 



0«W = (A/e)\ 
J/n ^ „ ,„ „ \I&B(x,<pf* 



; Xmax ) 



(1 - 2v9 ) (1 - 2 ^ )c - Vo (1 - 2<^ 1 ) (1 ~ 2 ^ )c ~ Vl (1 - 2v9 2 ) (1 ~ 2 ^ )c ~ V2 
(1 - 2^, 



-Vo)'- f 1 _ ■) - '. ; i) c (1 _ 2( / 9 2 ) (1_4 ^ 2)c 



2 C 



where lim e ^o,a' maa; ^+oo C2 ( e > %ib) = 1) and an explicit C2 ( e > %(u) can be obtained by tracking down 
the successive approximations made. Recall that ip 2 is U (x max ) — (p — (pi with lim a;ma!I ,_ 4+00 U (x max ) = 
1. 

The simple monotonic behaviour described above makes it possible provably to solve the system 
(|7|, 8) using a very basic iterative procedure that starts from an angle of the admissible rectangle 
in (fo,(fi and spirals towards the solution. Doing so for c = 2.468155, a sufficiently large x max and 
small e, one finds linin^ooE [Ie.^,,^^] 1 '" < 0.99999995. By monotonicity, this value of c, then, is 
an upper bound for C3. 
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